A SECOND ORDER SMOOTH VARIATIONAL PRINCIPLE ON 
RIEMANNIAN MANIFOLDS 



DANIEL AZAGRA AND ROBB FRY 

Abstract. We establish a second order smooth variational principle valid 
for functions defined on (possibly infinite-dimensional) Riemannian manifolds 
which are uniformly locally convex and have a strictly positive injectivity ra- 
dius and bounded sectional curvature. 



1. Introduction and main result 

It is well known that a continuous function defined on an infinite-dimensional 
manifold (or on a Banach space) does not generally attain a minimum in situations 
in which there would typically exist minimizers if the function were defined on a 
finite-dimensional manifold (for instance when the infimum of the function in the 
interior of a ball is strictly smaller than the infimum of the function on the boundary 
of the ball). In fact, as shown in pQ, the smooth functions with no critical points 
are dense in the space of continuous functions on every Hilbert manifold (this result 
may be viewed as a strong approximate version for infinite dimensional manifolds 
of the Morse-Sard theorem). So, when we are given a smooth function on an 
infinite-dimensional Riemannian manifold we should not expect to be able to find 
any critical point, whatever the overall shape of this function is, as there might be 
none. 

This is quite inconvenient because many important problems of PDEs and of 
optimization admit equivalent formulations as minimization (or variational) prob- 
lems, that is, one is given a continuous (or differentiable, or convex, or lower semi- 
continuous, etc) functional defined on a functional space, or on a (usually infinite- 
dimensional) manifold, and one is asked to find a minimizer of the functional, which 
will provide a solution of the initial problem. Under several classes of rather re- 
strictive assumptions (for instance convexity of the functional and reflexivity of the 
space) one may be able to show existence of minimizers, but in many natural cases 
(such as the problem of finding minimizing geodesies in infinite-dimensional Rie- 
mannian manifolds, to name one) exact minimizers do not generally exist, and one 
is thus forced to look for approximate minimizers, which will provide approximate 
solutions of the problem. 

This is why perturbed minimization principles, or variational principles, are im- 
portant. A variational principle typically asserts that, for a given lower semicontin- 
uous function / : X — > R, defined on a given space X, and such that / is bounded 
below, there exists a function ip : X — > K belonging to a given class C such that 
/ — if attains a global minimum and tp can be prescribed to be small in some sense. 
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Historically, Ekeland's variational principle [9] was the first perturbed minimization 
principle discovered, and one of the most powerful and widely applicable, because 
it holds for any complete metric space X; in this case the perturbing class C is the 
set of small scalar multiples of distance functions to points, that is the perturbing 
function ip has the shape of a flat cone. One of the many striking applications of 
Ekeland's principle was to show the existence of almost minimizing geodesies on 
complete infinite-dimensional Riemannian manifolds, that is an approximate ver- 
sion of the Hopf-Rinow theorem (as we recalled above, an exact analogue is false 
in infinite dimensions). 

Despite the generality and power of Ekeland's result, there are some limitations 
to the applicability of this minimization principle due to the fact that the perturbing 
functions ip are not differentiable. Indeed, there are many situations (for instance 
when one wants to build a theory of subdifferentiability, in nonsmooth analysis, and 
in the study of viscosity solutions to Hamilton- Jacobi equations) in which one needs 
the perturbing function ip to be differentiable. In order to remedy this deficiency 
Borwein and Preiss [5] established a smooth variational principle in which the space 
X is a Banach space with a C 1 smooth norm, and the perturbing functions (p are 
smooth functions which can be taken to be arbitrarily small and with an arbitrarily 
small Lipschitz constant. Later on, Deville, Godefroy and Zizler [HIE], by using 
a new method of proof based on the use of Baire's category theorem, were able 
to extend this smooth variational principle to the class of all Banach spaces with 
smooth bump functions, as well as for higher orders of smoothness. 

Smooth variational principles on Riemannian manifolds were not studied until 
very recently. In [2] a Riemannian version of the Deville-Godefroy-Zizler smooth 
variational principle was established within the class of complete Riemannian man- 
ifolds which are uniformly bumpable. This was applied to developing a theory 
of nonsmooth analysis and to the study of viscosity solutions to Hamilton- Jacobi 
equations on Riemannian manifolds. Later, in [3], the authors showed that the 
assumption on uniform bumpability can be dispensed with, by establishing that 
every Lipschitz function / defined on a Riemannian manifold can be uniformly ap- 
proximated by a sequence of C°° smooth Lipschitz functions (/„) in such a way 
that the Lipschitz constants of /„ converge to the Lipschitz constant of /. 

In this paper we will prove a second order smooth variational principle valid 
for functions defined on every complete Riemannian manifold of bounded sectional 
curvature, uniformly locally convex, and with a positive injectivity radius. In fact 
the result is valid (see Theorem 13 . 71 below) for every complete Riemannian manifold 
which is second order uniformly bumpable according to Definition 12.11 given below. 
The question whether or not every complete Riemannian manifold is second order 
uniformly bumpable is open, and seems to be much more subtle than its first order 
analogue. 

Let us state our main result. 

Theorem 1.1. Let M be a (possibly infinite- dimensional) complete Riemannian 
manifold. Assume that the sectional curvature of M is bounded, and that M is 
uniformly locally convex and has a positive injectivity radius. Then, for every lower 
semicontinuous function f : M — > (—00,00] which is bounded below, with f ^ +00, 
and for every e > 0, there exists a C 2 smooth function ip : M — > M such that 

(1) / — (p attains its strong minimum on M ; 

(2) |M|oo<e; 
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(3) Halloo < e; 

(4) ||dV||oo <£■ 

Recall that a function g is said to attain a strong minimum at a point xq provided 
g(xo) — inf^gM g(x) and lim^^.^ d(x n ,Xo) — whenever (x n ) is a minimizing 
sequence (that is, if linin^oo g(x n ) = g(xo)). 

In the statement of Theorem 11.11 we used the following notation: 

H^Hoo = sup \ip(x)\ 
\\d<p\\oo = sup lld^a;) || x 
||dVllt» = sup \\d 2 ip(x)\\ x , 



where 



||d<^(a;)|| = sup dp(x)(v) = sup (V(f(x),v) x , 

v£TM x ,\\v\\ x =l veTM x ,\\v\\ x =l 



and 

\\d 2 ip(x)\\ x = sup \d 2 (p(x)(v,v)\. 
«eTAf !t ,||ii|| !e =i 

Recall also that the Hessian D 2 ip of a C 2 smooth function ip on M is defined by 

D 2 l p(X,Y) = (V x Vtp,Y), 

where Vip is the gradient of <p; X, Y are vector fields on M, and Vx^ is the 
covariant derivative of a vector field Z with respect to a vector field X. The 
Hessian is a symmetric tensor field of type (0, 2) and, for a point p e M , the 
value D 2 ip(X,Y)(p) only depends of / and the vectors X(p),Y(p) £ TM P . So we 
can define the second derivative of cp at p as the symmetric bilinear form d 2 ip(p) : 
TM P x TM P -> K 

(«,«;) ^ dV(p)(«,^) := £>V*,^)(P), 

where X, V are any vector fields such that X(p) — v, Y(p) — w. A very useful way 
to compute d 2 ip(p)(v 7 v) is to take a geodesic 7 with 7'(0) = v and calculate 

d 2 

^2^(7(*))|t=o, 

which equals d 2 ip(p)(v,v). We will often write d 2 ip(p)(v) 2 instead of d 2 (p(p)(v, v). 

Let us fix some notation and terminology that we will be using throughout 
the paper. M = (M, g) will always be a (possibly infinite-dimensional) complete 
Riemannian manifold, and g(x) = (-,-) x its Riemannian metric. We refer to [11] 
for a basic introduction to infinite-dimensional Riemannian manifolds. However, 
instead of Lang's notation we will employ a more classic notation, such as that of 
Do Carmo's book [8]. The letters X, Y, Z, V, W will stand for smooth vector fields 
on M, and VyJ will always denote the covariant derivative of X with respect to 
Y. The Riemannian curvature of M will be denoted by R. Recall that the value 
of R(X, Y)Z := VxVyZ - VyV x Z - V[x,Y]% at a P oint P € M onl y depends on 
the values of X, Y, Z at p, and that the sectional curvature of M at a point p with 
respect a plane spanned by two vectors v, w £ TM p is defined by 

K^vM= {R \ V fl V) 
\v A w r 
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where \v A w\ is the area of the parallelogram defined by u, v in TM P . 

Geodesies in M will be denoted by 7, a, and their velocity fields by 7', a' . If X 
is a vector field along 7 we will often denote X'(t) — jrrX(t) — V T <(t)X(f). Recall 
that X is said to be parallel along 7 if X'(t) = for all t. The Riemannian distance 
in M will always be denoted by d(x, y) (defined as the infimum of the lengths of all 
curves joining x to y in M). 

We will often identify the tangent space of M at a point x, denoted by TM X , 
with the cotangent space at x, denoted by TM*. The space of bilinear forms 
on TM X (respectively symmetric bilinear forms) will be denoted by C 2 (TM X ) or 
£ 2 {TM X ,R) (resp. C 2 S (TM X ) or £ 2 (TM X ,R)). Also, we will denote by T 2 , S (M) the 
tensor bundle of symmetric bilinear forms, that is 

T 2 . S (M)= |J C 2 S (TM X ,R), 

and T 2>S (M) X = C 2 S (TM X ,R). 

We will make extensive use of the exponential mapping exp^ throughout the 
paper. Recall that for every x € M there exists a mapping exp x , defined on a 
neighborhood of in the tangent space TM X , and taking values in M, which is 
a local diffeomorphism and maps straight line segments passing through onto 
geodesic segments in M passing through x. 

By im(x) we will denote the inactivity radius of M at x, that is the supremum 
of the radius r of all balls B(0 X , r) in TM X for which exp^, is a diffeomorphism from 
B(0 X , r) onto B(x, r). The number im{x) is strictly positive for every x. Similarly, 
i(M) will denote the global injectivity radius of M, specifically i(M) = mi{iM(x) : 

x e M}. 

We will also need to recall some results about convexity in Riemannian manifolds. 
We say that a subset U of a Riemannian manifold is convex if given x,y S U there 
exists a unique geodesic in U joining x to y, and such that the length of the geodesic 
is d{x,y). Every Riemannian manifold is locally convex, in the sense that for every 
x E M, there exists c > such that for all r with < r < c, the open ball 
B(x,r) = exp x B(O x ,r) is convex (this is Whitehead's Theorem). The convexity 
radius of a point ieMma Riemannian manifold M is defined as the supremum 
in K+ of the numbers r > such that the ball B(x,r) is convex. We denote 
this supremum by cm{x) (by the result we have just mentioned, cm(%) is strictly 
positive for every x) . 

Whitehead's theorem gives rise to the notion of uniformly locally convex mani- 
fold: we say that a Riemannian manifold M is uniformly locally convex provided 
that there exists c > such that for every x € M and every r with < r < c 
the ball B(x,r) — exp x B(0 X , r) is convex. This amounts to saying that the global 
convexity radius of M (defined as c(M) := infjcM^) : % € M}) is strictly positive. 

It is also worth noting that the functions x t— > im{x) and x 1— * cm{x) are contin- 
uous, see [TO] . 

As noted above, the Hopf-Rinow theorem fails in infinite dimensions, and hence 
generally geodesies minimizing the distance between two given points do not exist in 
infinite-dimensional Riemannian manifolds. But, if M is locally uniformly convex 
and i{M) > 0, they do always exist locally. In fact they exist in a uniformly 
local way: for any r > with r < min{i(M), c(M)} and for all points x,y with 
d(x, y) < r, there is a unique (up to reparameterizations) minimizing geodesic 7 
connecting x and y (7 is defined by -f(t) = exp x (tw y ) , where w y — exp~ 1 (y)). 
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Moreover, d(x, y) is given by d(x, y) 2 = \\ exp~ 1 (y)\\ x , see [2j Proposition 3.9]. This 
will be extensively used in Section 2 of the paper. 

Let us also recall that for a minimizing geodesic 7 : [0, 1] — > M connecting x to 
y in M, and for a vector v £ TM X there is a unique parallel vector field P along 7 
such that P(0) = v. The mapping TM X 3 v P(£) g TM y is called the parallel 
translation of v along 7 and is a linear isometry from TM X onto TM y which we will 
denote by L xy . 

For a basic theory of Jacobi fields on infinite-dimensional Riemannian manifolds 
(and for any other unexplained terms of Riemannian geometry used in Section 2), 
we refer the reader to [TT] . 

Let us finish this introduction by providing a typical application of the smooth 
variational principle, namely to show the existence and density of the set of points 
of subdifferentiability (of second order, in our case). For a lower semicontinuous 
function / : M — > (—00, +00] we define the second order subjet of / at a point 
x £ M as the set 

J 2 '~f(x) — {(dip(x),d 2 ip(x)) : if £ C 2 (M, R), f—ip attains a local minimum at x}. 

Corollary 1.2. Let M be as in the statement of Theorem li.il and let f : M — > 

(—00, +00] be a lower semicontinuous function. Then the set {z £ M : J 2 -~ f(z) 7^ 
0} is dense in the set {x S M : f(x) < 00}. 

Proof. The proof is an easy consequence of Theorem ll.il Indeed, the same proof 
as in[2l Proposition 4.17] holds if we replace D~f(x) with J 2 -~f(x), and C 1 with 
C 2 . ' □ 



2. Second order uniformly bumpable manifolds 

In this section we will show that all complete Riemannian manifolds which are 
uniformly locally convex and have bounded sectional curvature and a strictly posi- 
tive injectivity radius are second order uniformly bumpable, meaning the following. 

Definition 2.1. We will say that a Riemannian manifold M is second order uni- 
formly bumpable provided there exist R > 1, r > such that for every z £ M and 
for every 8 £ (0, r) there exists a C 2 smooth function b : M — » [0, 1] such that 

(i) b(z) = 1 

(ii) b{x) — whenever d(x, z) > 8 

(iii) Hd&Hoo <R/8 

(iv) ||d 2 6||oo < R/8 2 . 

The key to the proof of the main result of this section is the following estimation 
for the norm of the second derivative of the square of the distance function, for 
which we have found no reference (of course there are well known estimations in 
the finite-dimensional case, see for instance [12l Lemma 2.9 and Exercise 4, page 
153], but they all depend on results established exclusively in finite dimensions, and 
of which we have found no infinite-dimensional versions in the literature). 

Proposition 2.2. If M is a Riemannian manifold whose sectional curvature K is 
bounded, say \K\ < Kq, and c(M) > 0, i{M) > 0, then, for every r with < r < 
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min{i (M), c(M), 7r/2v / ^o}, and for every z G M, the function <p(x) := d(x, z) 2 is 
C°° smooth on B(z, r) and its second derivative satisfies 

\\d 2 y{x)\\ x < (2+l i K d(x,zf) 

for all x G B(z, r). 

In the proof of this Proposition we will have to use some well known results 
about the second variation of the arc length and the energy functionals. Let us 
briefly review the facts that we will be using. 

Fix a number r such that < r < m\n{i{M),c(M) 1 -K and take two 
points x,xo G M with d(x,x ) < r. Let 7 be the unique minimizing geodesic, 
parameterized by arc-length, connecting x$ to x. Denote £ — d(x,xo), the length 
of 7. Consider a(t,s), a smooth variation of 7, that is a smooth mapping a : 
[0,£] x [-£,£] -> M such that a(t,0) = j{t) for all t G [0,4 Consider the length 
and the energy functionals, defined by 

L(s) = L{a„) = / \\a' s (t)\\dt 
Jo 

and 

E(s)=E(a s )= f \\a' s (t)fdt, 
Jo 

where a s is the variation curve defined by a s (t) = a(t,s) for every t G [0,£]. 
According to the Cauchy-Schwarz inequality (applied to the functions / = 1 and 
g(t) = \\a' s (t)\\ on the interval [0,£]) we have that 

L(s) 2 < £E(s), (0) 

with equality if and only if ||a^(t)|| is constant. In particular we have 

L(0) 2 =£E(0) 

because ao = 7 is a geodesic. 

Therefore, if we furthermore assume that a s is a geodesic for each s (that is a 
is a variation of 7 through geodesies) we have that 

L(s) 2 =lE(s) (1) 

for every s G [— £, e]. 

Now define <p(x) = d(x, x ) 2 . The function is C°° smooth on B(x 0l r) because 
r < min{i(M), c(M)}, so on this ball exp^ 1 is a C°° diffcomorphism, and 

d(x,x a ) 2 = Hexp-^aOHxo- 
Let us take a vector v G TM X . We want to calculate 

dMx)(v) 2 , 

which is given by 

d 2 

-^ip{a x (s))\ s=0l 

where a x (s) = exp a .(sw). To this end let us denote by a s : [0,£] — > M the unique 
minimizing geodesic joining the point xq to the point a x (s) (notice that now, for 
s 7^ 0, a s is not necessarily parameterized by arc- length), and let us define a : 
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[0,£] x [— e,e] — > M by a(i, s) = a s (t). Then a is a smooth variation through 
geodesies of j(t) = a(t, 0) and we have 

<p{a x {s))=L{sf =lE{s), 

and therefore 

d 2 v(x)(v) 2 = £E"(0). (2) 

If we denote X(t) = da(t, 0)/ds, the variational field of a, then the formula for the 
second variation of energy (see [11] , Chapter XI) tells us that 

Ie"(0)=J o ({X',X') (R( 7 ',X) 7 ',X))dt + (R^( t ,0), 7 '(t))\tZi, (3) 
or equivalently 

^"(0) = - f{X, X"+R{i, X)i)dt + (X(t), X>(t))\tzi + (R^(t, ), i{t)) \\% 

(3') 

where we denote X 1 = V 7 /(t)X, and X" = Vy(t)Jf'. 

Note that, since the variation field of a variation through geodesies is always a 
Jacobi field, and since the points x and xq are not conjugate (recall that r < i(M)), 
X is the unique Jacobi field along 7 satisfying that X(0) — 0, X(£) = v, that is X 
is the unique vector field along 7 satisfying 

X"(t) + R(<y'(t),X(t))j'(t) = 0, and X(0)=0, X(£) = v, 

where R is the curvature of M. 

On the other hand, since the curve s — * a{l,s) — o x (s) and s — + a(0, s) = xq 
are geodesies, we have that 

<^M),Y( t )>« = o. 

These observations allow us to simplify the formulas (3) and (3') by dropping the 
terms that vanish, thus obtaining that 

V(0) = J ((X', X') (i?( 7 ', X))dt, (4) 

and also 

l -E"{0) = {X{£),X\l)). (4') 

Recall that the right-hand side of (4) is called the index form and is denoted by 
I(X,X). 

By combining (2) and (4) we get 

I 4 

d 2 <p(x )(v) 2 = 21 / ((X', X') - (i?( 7 ', X)i, X))dt, (5) 
Jo 

or equivalently 

<P<p(x )(v) 2 =2£(X'(e),X(£)). (5') 

We are going to use these formulas to deduce the estimation in the statement of 
Proposition 12.21 But we will need to combine them with a couple of facts about 
Jacobi fields, and with the Rauch Comparison Theorem. First, we must use the 
following. 
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Lemma 2.3. Let 7 : [0,^] — * M be a geodesic whose length, £, is the distance 
between its end points, and let Y be a field along 7 with Y(0) = and Y(£) = 0. 
Then 

I(Y, Y) > 0. 

Proof. This result is stated and proved in [TTJ Theorem 1.7 of Ch. XI] under the 
additional assumption that Y is orthogonal to 7, but it is true for any Y (and in 
fact almost the same proof holds, with an additional remark). We will write the 
whole proof for the reader's convenience. Define 

f3(t,s) = exp l(t) (sY(t)) 

for < t < £ and < s < e, and for a sufficiently small e > 0. For each s, 
Ps(t) :— (3{t, s) is a curve, not necessarily a geodesic, joining the end points of 7, 
that is 

0.(0) =7(0), and&(€)= 7 M 

(because of the assumption that Y(0) = 0, Y{£) — 0). Moreover, ft(t,Q) = 7(t), so 
(3 is a variation of 7 which leaves the end points fixed, and the variation field of (3 
is 

os 

Using the formula for the second variation of energy [TTJ Chapter XI] (and taking 
into account that the curves s 1— > /3(0, s) and s >—>■ (3{i, s) are geodesies, in fact 
points), we have that 

V(0) = J ({Y', Y') - {R{i,Y)i, Y))dt := I{Y, Y). (6) 

On the other hand, according to equation (0), and bearing in mind that 7 is a 
minimizing geodesic, we have that 

£E(0) = L(0) 2 < L{sf < lE{s), 

hence 

E(0) < E{s). 

That is, a geodesic which minimizes length also minimizes energy. Therefore we 
have 

E"(Q) > 0, 

which, combined with equation (6) yields I(Y, Y) > 0. □ 
This Lemma allows us to improve Corollary 1.8 of [TTJ Chapter XI] as follows. 

Proposition 2.4. Let 7 : [0, £] — > M be a geodesic whose length, £, is the distance 
between its end points. Let X be a Jacobi field along 7, and Z any smooth vector 
field along 7 such that X(0) = Z(0) and X(£) = Z(£). Then 

I(X,X) < I{Z, Z). 

In summary, among all vector fields along 7 with the same boundary conditions, 
the unique Jacobi field along 7 determined by those conditions minimizes the index 
form. 
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Proof. The same proof as in [ITJ Chapter XI, Corollary 1.8] holds (the additional 
assumption in that X — Z is orthogonal to 7 is only made to be able to apply 
Theorem 1.7 in Chapter XI of [11] , which we have just improved in the above Lemma 
by dispensing with that extra assumption). See also Corollary 10 in Chapter 8 of 
[13j , where the finite-dimensional version of this Proposition is established with the 
same proof (which only uses the finite-dimensional version of the above Lemma and 
the second variation formula). □ 

As noted above, we will also need to use an infinite-dimensional version of the 
Rauch Comparison Theorem, which we next state for the reader's convenience. 

Theorem 2.5 (Rauch Comparison Theorem). Let M and M be Riemannian man- 
ifolds of the same dimension, which may be infinite. Let 7 ( resp. j) be geodesies in 
M (resp. M), parameterized by arc-length, and defined on the same interval [0,£]. 
Let J (resp J) be Jacobi fields along these geodesies, orthogonal to 7 (resp. j). 
Assume that 

(1) J(0) = J(0) = 0, and J(t), J{t) 7^ forO <t<£. 

(2) ||J'(0)|| = ||J'(0)||. 

(3) The length of 7 is the distance between its end points. 

(4) The sectional curvature of M is less than or equal to the sectional curvature 
ofM. 

Then, for all t 6 [0,£], we have 

\\J(t)\\<\\J(t)l 

and _ 

(jy),jjt)) < (j'(t),j(t)) 
(j(t),j(t)) ~ (J(t),j(t)) ' 

A proof of this result can be found in [TTl p. 319]. The last inequality does not 
appear in Lang's statement of [TTJ Chapter XI, Theorem 5.1], but is established 
and used in his proof. 

Now we are ready to provide a 

Proof of Proposition 12.21 
According to equations (5) and (5') 

d 2 p(x )(v) 2 = 2£ f\(X',X') - (R{i,X)i,X))dt =2£(X'(£),X(£)), 
Jo 

where X is the unique Jacobi field along 7 such that X(Q) — and X(£) =v (and 
7 : [0, £] — > M is the unique minimizing geodesic, parameterized by arc length, 
such that 7(0) = xq,j(£) = x). Recall that we are assuming d(x, Xq) < r < 
min{i(M),c(M), n/2y/KE}). 

With the help of Proposition 12.41 and the Rauch Comparison Theorem, we are 
going to estimate d 2 tp(xo)(v) 2 . Consider the vector field along 7 defined by 

z(t) - \p{t), 

where Pit) is parallel along 7 and Pi£) — v. Since P is parallel (that is P'(t) — 
Vy( t )P(t) = 0), we have 

z'(t) = \pit), 
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and also 



(Z'(t),Z'(t)) = ±\\v\\l, 



because parallel translation is a linear isometry. 

On the other hand, by our assumption on the sectional curvature, we have 

-(R(y(t),p(t)y/(t),p(t)) < 

Kotfit) A P(t)\ 2 < K4i{t)\\ 2 \\P{t)\\ 2 = K Q \\v\\ 2 . 

Therefore 

i(z, z) = f\z'(t), z'(t)) (ij(y(t), z(t)W(t), z(t))dt = 

Jo 



/() 
1 

1 



£2 I 



(Rtf(t),jP(t)W(t),jP(t))dt = 



\v\\l-1*(Rtf(t),P(t)W(t),P(t))dt< 



/ v 



+ t 2 K \\v\\ 2 dt = 



£ 

-K 

3 ° 



Using Proposition I2.4I we deduce that 



I(X,X)<I(Z,Z) < 



hence that 



d 2 cp(x)(vf = 2H(X,X) < 



2£ 2 
2 + ^K 



\\v\ 



(7). 



In order to conclude the proof we only need to make sure that the left-hand side 
of this inequality is nonnegative. To obtain a lower bound for ||d 2 (^(xo)|| we will 
make use of the Rauch Comparison Theorem stated above. We will compare our 
manifold M with a manifold M of constant curvature equal to _Ko, modelled in the 
same Hilbert space as M is. 

Assume first that X is orthogonal to 7. Take a geodesic 7 : [0, £} — > M of length 
£, and a vector v £ TMjm) orthogonal to 7'(0) with ||«|| = ||X'(0)||. A Jacobi field 
X along 7 with X(0) = and ||X'(0)|| = ||X'(0)|| is given by 



X(t) 



^0 



-P(t), 



where P(t) is the parallel translation of the vector v along 7, with P(0) = v, see 
[TT1 Chapter IX, Proposition 2.12]. Since the sectional curvature of M is less than 
or equal to K Q , X(0) = = X(0) but X{t) ^ ^ X(t) for < * < £ (recall that 
7,7, having lengths less than i(M), do not have conjugate points, see [TTI Theorem 
3.1 of Chapter IX]), ||X'(0)|| = ||X'(0)||, X is orthogonal to 7 and X is orthogonal 
to 7, we get from the Rauch Comparison Theorem that 

(xy),x(t)) < (x'(t),x{t)) 
(X(t),X(t)) ~ (x(t),x(t)) 
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for all t € (0,4 Since \\P(t)\\ = \\v\\ = \\X(£)\\ and X'{t) = cas{*/K^t)P(t), we 
deduce, by taking t — £, that 

r—cos{^K^) {X'{t),X{l)) (X'(l),X(£)) 



i(V^) ~ (X(£),X(£)) 
hence 



(x\e),x(e))>^K cos ^ £ h v\\l. (s) 

In particular, since £ < r < ir we obtain that 

(x'(£),x(e)) >o. 

On the other hand, if X is tangent to 7 then 

x(t) = ±^|| u |uy(i) ) 

and 

X'(t)=±- i \\v\\ xl '(t), 

hence we also have 

(X'(£),X(£)}>0. 

Now, from Propositions 2.3 and 2.4 of Chapter IX of [TT], we know that every 
Jacobi field X along 7 with X(0) = can be written in the form 

X = X T +X X , 

where X T and X 1 - are Jacobi fields along 7, X T and (X T )' are tangent to 7, 
and X- 1 and (X- 1 -)' are orthogonal to 7. In particular (X T , (X )') = and 
(X- 1 , (X T )'> = 0. This implies that 

(X'(t),X(t)) = <(X T )'(i), X T (<)) + <(X X )'(i), X x «) > 0, 

and hence 

d 2 tp{x)(v) 2 = 2£{X'{l),X{l)) > 0, (9) 

in any case. 

By combining (9) with equation (7) above we finally get 

0<rf 2 ^)M 2 <(2 + yifo) \\v\\l 
for all v € TM X , which implies 

2f 2 

\\<?y{x)\\ x < (2 + —K d(x,x ) 2 ), 



and the proof of Proposition ^. 2l finishes. □ 

Corollary 2.6. If i(M) > 0, c(M) > 0, and the sectional curvature of M is 
bounded, then M is second order uniformly bumpable. 

Proof. Fix any r > with r < min{i (M), c(M)}, and define R = 46 + 2Kor 2 , 
where Kq is a bound for the sectional curvature of M . For every S £ (0, r) find a 
C°° function : M — > [0, 1] such that 

(i) 0(t) = 1 for t < 

(ii) 9(t) = for i > 5 2 

(iii) Halloo < 3/S 2 

(iv) Klloo < 10/S 4 . 
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b(x) 



Now, for a given z £ M, consider the function (f(x) — d(x, z) 2 , and define b : M — > 
[0, 1] by 

'0{<p{x)) iid(x,z)<S, 
otherwise. 

It is clear that b is C°° smooth on M and b satisfies conditions (i) and (ii) of 
Definition O 

In order to estimate ||t26||ao) nrs t note that, since x i— > d(x, z) is 1-Lipschitz, we 
have 

\\dip{x)\\ < 2d(x,z) (8) 
for all x £ B(z, r). Then, for all x £ B(z, 5), 

\\db(x)\\ = \e>(v(x))\\\d<p(x)\\<^26=~, 

and therefore 

ii ii 6 R 

||tft||oo < £ < J, 

so condition (m) of the Definition is met as well. On the other hand, by using 
again (8), as well as Proposition ^. 2\ we can estimate ||d 2 6||oo as follows. For every 
x £ B(z,5) we have 

\d 2 b(x)(v) 2 \ = \9"{ip.{x)) (dip(x)(v)) 2 + 6'{ V {x))d 2 V (x){v) 2 \ < 

\e"{ip{x)) {d^{x){v)f | + \6'{ip{x))d^{x){vf\ < 



S^ S)+ p{ 2+ 3 Kor J' S 2 
which implies 

2 46 + Mfbr* i? 
||d vlloo < j 2 = ^ 

that is condition (iv) of Definition 12.11 is also satisfied. □ 



3. The rest of the proof 

In this section we will show that every complete Riemannian manifold which is 
second-order uniformly bumpable satisfies the natural translation to the Riemann- 
ian setting of the second-order DGZ smooth variational principle (established in 
[5] for all C 2 smooth Banach spaces). The result will be a consequence of several 
auxiliary lemmas. 

Lemma 3.1. Let M be a second order uniformly bumpable Riemannian manifold. 
Then there are numbers C > l,r > such that for every p £ M, e > and 
S £ (0, r) there exists a C 2 smooth function b : M — > [0, s] such that: 

(1) b(p) = s= \\bWoo := sup xeM \b{x)\. 

(2) Ud&Hoo := sup xeM \\db(x)\\ x < Ce/S. 

(3) ||d 2 fo||oo := sn PxeM \\d 2 b(x)\\ x < Ce/S 2 . 

(4) b(x) =0ifx£' B(p,S). 

In particular, max{\\b\\oo, Ud&Hoo, ||d 2 6||oo} < Ce(l + 1/5 + 1/5 2 ). 

Proof. In the case when s = 1 we get a required b from the definition of second 
order uniform bumpability. If e ^ 1, it is enough to multiply b by e. □ 
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Lemma 3.2. The space Y = {p £ C 2 (M, R) : ip, \\dip\\ , \\d 2 ip\\ are bounded on M}, 
endowed with the norm 

\\ip\\ Y := maxlll^Hoo, Halloo, 1 1 c? 2 <p 1 1 oo 1 , 

is a Banach space. 

Proof. The space (Y,\\ ■ \\y) is clearly a normed space. Let us check that it is 
complete. 

Let (tp n ) be a Cauchy sequence in (Y, || • \\y)- Since ||<£n||y > ||<p n ||oo and the 
space of continuous bounded functions on M with the norm || • ||oo is complete, we 
know that there exists a continuous bounded function ip : M — > M such that 

\\tfn - f\\oo — > 0. 

We have to see that ip £ C 2 (M), dip and d 2 ip are bounded, and \\d<p n — dip\\ oo — » 0, 
and \\d 2 tp n — d 2 tp\\ 00 — » 0. To this end, fix a point ir e M and a number r with 
< r < i M (aO and such that both cxp^, : B(0 x ,r) C TM X -» B(x,r) C M 
and its inverse exp" 1 are 2-Lipschitz diffeomorphisms, and consider the functions 
^(y) = *Pn° exPx and = V ex Pa;7 defined on the ball S(0 x ,r) in TM X . We 
have 

dip n (w y )(v) = d<p n (y)(dexp x (w y )(v)), (1) 

and 

di>(w y )(y) = dip(y)(dexp x (w y )(v)), (2) 
where we denote w y = exp~ 1 (y). Therefore 

sup \\dip m (wy)-dip n (w y )\\ < 2 sup ||c^ TO (y)-<fy>„(y)|| < 2||dv? m -d<p n || 00 

«J y eS(O a; ,r) yeB(x,r) 

and, because (<p„) is a Cauchy sequence in Y, the definition of \\-\\y implies that the 
right-hand side of the above inequality goes to as to, n — > oo, which shows that 
(tpn) is a Cauchy sequence in the space {/ £ C 1 (i3(0,r)) : / and df are bounded } 
with the norm ||/|| = maxjJI/Hoo, ||c?/||oo}- Since this space is complete it follows 
that tp n converges to some ip £ C 1 (B(0, r)), and dip n converges to dip, in the norm 
|| • ||oc. On the other hand we already know that \\<p n — p\\oo — ► 0, which implies 
\\ipn — "01 |oo — > 0, so ip — ip by the uniqueness of the limit. Therefore ip — ip o exp" 1 
is C 1 on B(x,r), and since x is arbitrary it follows that ip £ C 1 (M). 

To see that d<p n converges to dtp in the norm || • ||oo, let us first observe that, equa- 
tions (1) and (2), together with the facts that dexp x (w y ) is a linear isomorphism 
and dip n — > dip, imply that 

\\d^n(y) - dp(y)\\ y ^ 0, (3) 

that is d<p n — > pointwise on TM* . Now, since cfy?„ is a Cauchy sequence in the 
norm || • ||oo, for every e > there exists % £ N such that 

\\d<Pn(y) ~ d<p m {y)\\y < £ (4) 

for all y £ M, whenever n,m> n . By taking limits as to — > oo in (4), and using 
(3) and continuity of || • H^, we deduce that 

\\dp n {y) -dtp(y)\\ y < e 

for all y £ M, whenever n > uq. This shows that \\dcp n — dv'lloo — ► 0. 

In order to check that </? e C 2 (M) and d 2 (^„ — > d 2 <p, we need to use the following 
Fact, which relates the second derivatives of ip n and <p n . 
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Fact 3.3. Let f : M —>■ K be a C 2 smooth function, and define h = f o exp x on a 
neighborhood of a point £ TM X . Let V be a vector field defined on a neighborhood 
of in TM X , and consider the vector field defined by V(y) — dexp x (w y )(V(w y )) 
on a neighborhood of x, where w y := exp 2 T 1 (y), and let 

a y (t) = exp x (w y +tV(w y )). 

Then we have that 

D 2 h(V, V)(w y ) = D 2 f(V, V){y) + (V/(y), <(0)). 

The proof of this fact is just a calculation, see [H Lemma 2.7]. We apply this with 
V(w) = v, a constant field on TM X , f = cp n , h = tp n to obtain 

d 2 ip n (w y )(v,v) = d 2 tp n (y)(dexp x (w y ){v),dexp x (w y )(v)) + (V<p n (y), a£(0)), (5) 

where 

(T y (t) = exp x (iu H + tv). 
Since the vector field M3|/h cr'' (0) £ TM is continuous and c"(0) = 0, we can 
assume without loss of generality that r > is small enough so that || c^' (0) 1 1 y < 1 
for all y £ B(x, r). Then, again using the fact that exp x is 2-biLipschitz on B(x, r), 
we get from (5) that 

sup \\d 2 ip n (w y ) - d 2 ip m (w y )\\ < 

»„eB(0,r) 

2 sup \\d 2 ip n {y) - d 2 ip n (y)\\ + sup \\dip m (y) — d(p n (y)\\ < 

y£B(x ,r) y£B(x,r) 
2\\d 2 tp n - d 2 Vm||oo + \\dlfin ~ dlfimWoo, 

which implies that (^„) is a Cauchy sequence in the space {/ £ C 2 (B(0,rj) : 
f, df, and d 2 f are bounded } with the norm ||/|| = max{||/|| 0O , Halloo, ||d 2 /l|oo}- 
This space is well known to be complete, hence there exists some ip £ C 2 (B(Q,r)) 
such that ipn converges to ip^ an d d 2 ip n converges to d 2 ip, in the norm || • W^. 
Since we already know that \\ip n — ip\\oo —> 0j w e get that ip = ip, and therefore 
if = ip ° exp" 1 is C 2 on B(x,r). It follows that <p £ C 2 {M). 

Moreover, equation (5) (and the same equation replacing (p n with ip and ip n 
with ip), together with the facts that d exp K (w y ) is a linear isomorphism, and that 
dfn — > (p, imply that 

\\d 2 <f n (y) - d 2 (f(y)\\ y -> 0, 
for each y £ B(x,r), that is d 2 <p n —> d 2 (p pointwise on T2 tS (M). 

By combining this with the fact that (d 2 Lp n ) is a Cauchy sequence in the norm 
|| ■ Hoc, one can easily deduce (as in the case of (dip n )) that ||d 2 </5„ — d 2 <^||oo — > 0. □ 

In the sequel B((p,r) stands for the open ball of center tp and radius r in the 
Banach space Y. 

Lemma 3.4. Let M be a complete metric space, and (Y, || • ||) be a Banach space 
of real-valued bounded and continuous functions on M satisfying the following con- 
ditions: 

(1) IMI ^ || V II oo — su P{|'y 9 ( a; )l : x S M} for every ip £ Y . 

(2) There are numbers C > l,r > such that for every p £ M, s > and 
S £ (0, r) there exists a function b £ Y such that b(p) — e, ||6||y < Ce(l + 
1/5 + l/S 2 ), and b(x) = if x & B(p,S). 
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Let f : M — > RU {+00} be a lower semicontinuous function which is bounded below 
and such that Dom(f) = {x £ M\f(x) < +00} 7^ 0. Then, the set G of all the 
functions tp £ Y such that f + tp attains a strong minimum in M contains a G$ 
dense subset ofY . 

Proof. The proof follows the lines of that of Lemma 3.13 in [2] (which in turn is very 
similar to the original proof of [7]), with some small changes. We write the complete 
proof (rather than just indicating the changes) for the reader's convenience and for 
completeness. 

Take a number N £ N such that N > 1/r, and for every n £ N with n > N, 
consider the set 

U n = {tp £ Y\ 3x Q £ M : (/ + tp)(x ) < inf{(/ + tp)(x)\x £ M\B(x , -)}}. 

n 

Claim 3.5. U n is open. 

Take tp £ U n . By the definition of U n there exists Xq £ M such that (/ + tp){xo) < 
M{(f + tp)(x)\x £ M\B(x , i)}. Set 2p = inf{(/ + v ){x)\x £ M\B(x , 1)} - (/ + 
<p)(x ) > 0. Then, since || • || y > || • H^, we get that B Y (<p,p) C Boo((p,p) C J7 n . 

Claim 3.6. C/ n is dense in Y. 

Take y S V and e > 0. Since / + ip is bounded below there exists xo € M such 
that (/ + ip)(xo) < inf{(/ + <p){x)\x £ M} + e. Set now 6 = 1/n < r, and use 
condition (2) to find a function b £ Y such that b(xo) — e, \\b\\y < C(n 2 + n + l)e, 
and&(x) =0forx^B(x ,i). Then (/ + ip)(x ) - b(x ) < inf{(/ + tp)(x)\x £ M} 
and, if we define h = —b, we have 

{f + <p + h)(x ) < inf{(/ + <p)(x)\x £ M} < inf{(/ + <p)(x)\x & B(x , -)}. 

n 

Since in£{(f + ip)(x)\x £ B(xq, i)} = inf{(/ + ^ + /i)(x)|.T ^ i?(a; , ^)}, it is obvious 
that the above inequality implies that ip + h £ U n . On the other hand, we have 
11^-11 y < C(n 2 +n + l)e. Since C and n are fixed and e can be taken to be arbitrarily 
small, this shows that ip £ U n , and U n is dense in Y. 

Therefore we can apply Baire's theorem to conclude that the set G = H^Lat U n 
is a Gs dense subset of Y. Now we must show that if ip £ G then / + tp attains 
a strong minimum in M. For each n > N, take x n £ M such that (/ + ip)(x n ) < 
inf{(/ + ip)(x)\x ^ B(x n , —)}. Clearly, Xk £ B(x n , -) if k > n, which implies that 
(3; n )™ =JV is a Cauchy sequence in M and therefore converges to some xq £ M . Since 
/ is lower semicontinuous and fX^ =N B(xq, 1/n) — {xq}, we get 

(/ + <p){x ) < liminf(/ + tp){x n ) < liminf [inf {(/ + <p)(x)\x £ M \ B(x , -)}] 

n 

= inf{inf{(/ + <p)(x)\x £ M\B(x , -)} : n £ N, n > N} 

n 

= inf{(f + V )(x)\x£M\{x }} : 

which means that / + ip attains a global minimum at xq £ M. 

Finally, let us check that in fact / + <p attains a strong minimum at the point 
x . Suppose {y n } is a sequence in M such that (/ + g){y n ) — ► (/ + <?)(#o) an d 
(y n ) does not converge to xq. We may assume d(y n ,Xo) > s for all n. Bearing in 
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mind this inequality and the fact that xo = limir n , we can take k £ N such that 
d(xk,y n ) > h f° r au n i an d therefore 

(/ + <p)(xo) < (/ + <p)(x k ) < inf{(/ + y){x)\x £ B(x k , i)} < (/ + ^)(y„) 
for all n, which contradicts the fact that (/ + <p)(y n ) — > (/ + <^)(a;o). D 

By combining Lemmas 13.11 13.21 and 13.41 we immediately deduce the following. 

Theorem 3.7. Let M be a second order uniformly bumpable complete Riemannian 
manifold. Then, for every lower semicontinuous function f : M — > (—00,00] which 
is bounded below, with f ^ +00, and for every e > 0, there exists a C 2 smooth 
function tp : M — > M such that 

(1) / — (p attains its strong minimum on M 

(2) Halloo < £ 

(3) Ht^Hoo <E 

(4) lld^lloo <e 

Finally, Theorem 11.11 follows from Theorem 13 . 71 and Corollarv l2.6l 
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